The Avivavoz Phrase-based Statistical Machine Translation System for Albayzin
نویسندگان
چکیده
This paper describe the SMT system developed by the TALP Research Center of the UPC (Universitat Politècnica de Catalunya) for the Albayzin 2008 evaluation campaign (Spanish to Basque translation task). Apart from a standard set of feature models, the system introduces two target language models: one based on lemmas and the second based on linguistic classes (Part-of-Speech). The word-to-word alignment was obtained using a segmented version of the Basque corpus. The results obtained over the development and test sets are analyzed and discussed.
منابع مشابه
The RWTH Aachen German-English Machine Translation System for WMT 2014
This paper describes the statistical machine translation (SMT) systems developed at RWTH Aachen University for the German→English translation task of the ACL 2014 Eighth Workshop on Statistical Machine Translation (WMT 2014). Both hierarchical and phrase-based SMT systems are applied employing hierarchical phrase reordering and word class language models. For the phrase-based system, we run dis...
متن کاملThe ISL Phrase-Based MT System for the 2007 ACL Workshop on Statistical Machine Translation
In this paper we describe the Interactive Systems Laboratories (ISL) phrase-based machine translation system used in the shared task ”Machine Translation for European Languages” of the ACL 2007 Workshop on Statistical Machine Translation. We present results for a system combination of the ISL syntax-augmented MT system and the ISL phrase-based system by combining and rescoring the n-best lists ...
متن کاملEnriching a statistical machine translation system trained on small parallel corpora with rule-based bilingual phrases
In this paper, we present a new hybridisation approach consisting of enriching the phrase table of a phrase-based statistical machine translation system with bilingual phrase pairs matching structural transfer rules and dictionary entries from a shallowtransfer rule-based machine translation system. We have tested this approach on different small parallel corpora scenarios, where pure statistic...
متن کاملEdinburgh's Statistical Machine Translation Systems for WMT16
This paper describes the University of Edinburgh’s phrase-based and syntax-based submissions to the shared translation tasks of the ACL 2016 First Conference on Machine Translation (WMT16). We submitted five phrase-based and five syntaxbased systems for the news task, plus one phrase-based system for the biomedical task.
متن کاملNUT-NTT statistical machine translation system for IWSLT 2005
In this paper, we present a novel distortion model for phrase-based statistical machine translation. Unlike the previous phrase distortion models whose role is to simply penalize nonmonotonic alignments[1, 2], the new model assigns the probability of relative position between two source language phrases aligned to the two adjacent target language phrases. The phrase translation probabilities an...
متن کامل